A marginal mixture model for selecting differentially expressed genes across two types of tissue samples.

نویسندگان

  • Weiliang Qiu
  • Wenqing He
  • Xiaogang Wang
  • Ross Lazarus
چکیده

Bayesian hierarchical models that characterize the distributions of (transformed) gene profiles have been proven very useful and flexible in selecting differentially expressed genes across different types of tissue samples (e.g. Lo and Gottardo, 2007). However, the marginal mean and variance of these models are assumed to be the same for different gene clusters and for different tissue types. Moreover, it is not easy to determine which of the many competing Bayesian hierarchical models provides the best fit for a specific microarray data set. To address these two issues, we propose a marginal mixture model that directly models the marginal distribution of transformed gene profiles. Specifically, we approximate the marginal distributions of transformed gene profiles via a mixture of three-component multivariate Normal distributions, each component of which has the same structures of marginal mean vector and covariance matrix as those for Bayesian hierarchical models, but the values can differ. Based on the proposed model, a method is derived to select genes differentially expressed across two types of tissue samples. The derived gene selection method performs well on a real microarray data set and consistently has the best performance (based on class agreement indices) compared with several other gene selection methods on simulated microarray data sets generated from three different mixture models.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Profound Transcriptomic Differences Found between Sperm Samples from Sperm Donors vs. Patients Undergoing Assisted Reproduction Techniques Tends to Disappear after Swim-up Sperm Preparation Technique

Background Although spermatozoa delivers its RNA to oocytes at fertilization, its biological role is not well characterized. Our purpose was to identify the genes differentially and exclusively expressed in sperm samples both before and after the swim-up process in control donors and infertile males with the purpose to identify their functional significance in male fertility. MaterialsAndMethod...

متن کامل

The Application of a Non-Radioactive DD-AFLP Method for Profiling of Aeluropus lagopoides Differentially Expressed Transcripts under Salinity or Drought Conditions

Aeluropus lagopoides is a salt and drought tolerant grass from Poaceae family, distributed widely in arid regions. There is almost no information about the genetics or genome of this close relative of wheat that stands harsh conditions of deserts. Differential Display Amplified fragment length polymorphism (DD-AFLP) led to the improvement of a non-radioactive method for which many parameters we...

متن کامل

A comparative review of statistical methods for discovering differentially expressed genes in replicated microarray experiments

MOTIVATION A common task in analyzing microarray data is to determine which genes are differentially expressed across two kinds of tissue samples or samples obtained under two experimental conditions. Recently several statistical methods have been proposed to accomplish this goal when there are replicated samples under each condition. However, it may not be clear how these methods compare with ...

متن کامل

MMAD: microarray microdissection with analysis of differences is a computational tool for deconvoluting cell type-specific contributions from tissue samples

BACKGROUND One of the significant obstacles in the development of clinically relevant microarray-derived biomarkers and classifiers is tissue heterogeneity. Physical cell separation techniques, such as cell sorting and laser-capture microdissection, can enrich samples for cell types of interest, but are costly, labor intensive and can limit investigation of important interactions between differ...

متن کامل

Investigating the Function of Predicted Proteins from RNA-Seq Data in Holstein and Cholistani Cattle Breeds

This study was performed to determine the digital expression profile of different genes expressed in Holstein and Cholistani breeds as well as to evaluate the performance of predicted proteins derived from differentially expressed genes between these two breeds using RNA-Seq data. For this purpose, the whole mRNA sequence for a blood sample of American Holstein and Pakistani Cholistani cattle p...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • The international journal of biostatistics

دوره 4 1  شماره 

صفحات  -

تاریخ انتشار 2008